Running Head: Behavioral Profiles 1 Behavioral Profiles: A fine-grained and quantitative approach in corpus-based lexical semantics

نویسنده

  • Stefan Th. Gries
چکیده

This paper introduces a fairly recent corpus-based approach to lexical semantics, the Behavioral Profile (BP)approach. After a short review of traditional corpus-based work on lexical semantics and its shortcomings, I explain the logic and methodology of the BP approach and exemplify its application to different lexical relations (polysemy, synonymy, antonymy) in English and Russian with an eye to illustrating how the BP approach allows for the incorporation of different statistical techniques. Finally, I briefly discuss how first experimental approaches that validate the BP method and outline its theoretical commitments and motivations. Introduction In this paper, I will provide an overview of a recent approach towards corpus-based lexical semantics that tries to go beyond most previous corpus-based work, the so-called Behavioral Profile approach. This remainder of this first section provides a necessarily brief and general overview of previous traditional corpus-linguistic work in lexical semantics and mentions the shortcomings of such work and how the Behavioral Profile approach attempts to address them. Lexical semantics is the domain of linguistics that has probably been studied most with corpora. The main assumption underlying nearly all corpus-based work in lexical (and constructional) semantics is that the distributional characteristics of a linguistic expression reveal many if not most of its semantic and functional properties. The maybe most widely-cited statement to this effect is Firth's (1957:11) famous dictum that "[y]ou shall know a word by the company it keeps." However, other quotes may be actually even more explicit and instructive, such as Bolinger's (1968:127) statement that "a difference in syntactic form always spells a difference in meaning" or Cruse's (1986:1) statement that "the semantic properties of a lexical item are fully reflected in appropriate aspects of the relations it contracts with actual and potential contexts." Most explicit in this regard is Harris (1970:785f.):

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Behavioral Profiles: a fine-grained and quantitative approach in corpus-based lexical semantics

The domain of linguistics that has probably been studied most with corpora is lexical semantics. The main assumption underlying nearly all corpus-based work in lexical (and constructional) semantics is that the distributional characteristics of a linguistic expression reveal many if not most of its semantic and functional properties. The maybe most widely-cited statement to this effect is Firth...

متن کامل

That's So Annoying!!!: A Lexical and Frame-Semantic Embedding Based Data Augmentation Approach to Automatic Categorization of Annoying Behaviors using #petpeeve Tweets

We propose a novel data augmentation approach to enhance computational behavioral analysis using social media text. In particular, we collect a Twitter corpus of the descriptions of annoying behaviors using the #petpeeve hashtags. In the qualitative analysis, we study the language use in these tweets, with a special focus on the fine-grained categories and the geographic variation of the langua...

متن کامل

Inferring Semantics from Collocation Clusters to Represent Verbs and Nouns

Current lexical semantic theories provide representations at a coarse grained level. In this paper, I will provide motivations for a fine grained representation for verbs and. nouns. An initial case study is done to serve as evidence that a more detailed representation is needed for tasks that require high accuracy rates, such as machine translation. An automatic approach to gather fine grained...

متن کامل

Behavioral profiles: A corpus-based perspective on synonymy and antonymy*

1 Introduction 1.1 Two empirical perspectives in the study of synonymy and antonymy The domain of linguistics that has arguably been studied most from a corpus-linguistic perspective is lexical, or even lexicographical, semantics. Already the early work of pioneers such as Firth and Sinclair has paved the way for the study of lexical items, their distribution, and what their distribution reveal...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010